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Abstract 

The energy of a finite system thermally connected to a thermal reservoir may fluctuate, while 
the temperature is a constant representing a thermodynamic property of the reservoir. The fi- 
nite system can also be used as a thermometer for the reservoir. From such a perspective the 
temperature has an uncertainty, which can be treated within the framework of estimation theory. 
We review the main results of this theory, and clarify some controversial issues regarding tem- 
perature fluctuations. We also offer a simple example of a thermometer with a small number of 
particles. We discuss the relevance of the total observation time, which must be much longer than 
the decorrelation time. 
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I. INTRODUCTION 



In equilibrium thermodynamics there is a one-to-one relation between the energy of a 
macroscopic system, which is not at a phase transition, and its temperature. The role of 
the temperature is to control the transfer of energy between the system and other systems 
thermally coupled to it. Thermal (heat) reservoirs are assumed to have infinite energy and 
are characterized only by their temperature. A finite system in thermal contact with a 
thermal reservoir will attain the temperature of the reservoir, which is a device designed to 
bring a body to a well defined temperature. 

If we know the energy U of a thermodynamic system A in equilibrium, we can adopt two 
perspectives: 

1. A plays the role of a thermometer and can be used to determine the temperature, 
T{U), of a thermal reservoir IZt with which the system is, or had been, in contact. 

2. A performs the role of a thermal reservoir and can be used to assign the same tem- 
perature T(U) to all its subsystems. We may assume that A has been brought into 
contact with an appropriate reservoir to acquire the given U and T and then isolated, 
thus keeping its subsystems in equilibrium at the temperature T. 

The microscopic aspects of statistical mechanics alters these perspectives by introducing 
fluctuations of physical quantities in equilibrium so that a finite system in equilibrium with 
a thermal reservoir at temperature T does not have a well defined energy, but a well definite 
distribution of energy, P(E,T), and a well defined average energy, (E) = U(T). The 
temperature of the reservoir becomes the parameter that controls the distribution of the 
energy of the finite system. 

Energy fluctuations are practically unobservable for macroscopic bodies. However, if 
systems of all sizes are considered, there is a conceptual problem with respect to these 
perspectives. If contact with the reservoir at temperature T does not guarantee a unique 
energy of a system, but can determine only a distribution of energies for system, then if 
the isolated system A has a given energy, how can we be sure that A was in contact with 
reservoir 1Z T and not with reservoir 1Z T ' , with T' ^ T? To what extent can we assume that 
A and its subsystems are in equilibrium at temperature T and not at temperature T'l 
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In statistical mechanics the (almost) one-to-one relation between U and T is recovered for 
macroscopic bodies, because of the relative smallness of the energy fluctuations. However, 
the problem of assigning a temperature to a given energy is relevant for non-macroscopic 
bodies. 

Our understanding of temperature fluctuations has a long history. Einstein showed that 
the statistical properties of macroscopic variables can be determined in terms of quantities 
determined in thermodynamic equilibrium. In Sec. [TT] we review the basic concepts of the 
Einstein approach. Temperature fluctuations have a special status in fluctuation theory. The 
Einstein theory yields formal expressions for ((5T) 2 ). Some authors suggest that temperature 
and energy are complementary, similar to position and momentum in quantum mechanics.- 
In contrast, others have stressed the contradictory nature of the concept of temperature 
fluctuations^: in the canonical ensemble, which describes contact with a thermal reservoir, 
the temperature is a parameter, so it cannot fluctuate. For a discussion of temperature 
fluctuations see Refs. y|j4. 

Mandelbrot has shown that the problem of assigning the temperature of the thermal 
reservoir, to which a system had been in thermal contact, can receive a satisfactory answer 
within the framework of estimation theory- This analysis shows that as a system becomes 
smaller, the second perspective gradually loses its meaning (a small system cannot be con- 
sidered as a thermal reservoir), and the first perspective maintains its validity, because a 
small system can be used as a thermometer by repeating the measurement of its energy a 
suitable number of times. 

In the usual course on statistical physics the theory of fluctuations is explained using the 
energy or number of particles. Students might gain the impression that the same approach 
can be applied to other quantities such as the temperature. This issue is controversial, 
because temperature is usually a parameter and not a fluctuating quantity. With the advent 
of small systems such as nanosystems and biomolecules, a fluctuating temperature is often 
discussed in research. 

Our paper is an effort to explain the possible pitfalls of generalizing fluctuation theory to 
the wrong quantities, as well as to illustrate a meaningful way of introducing temperature 
fluctuations. We first briefly review the contribution by Mandelbrot to understanding tem- 
perature fluctuations.- We will also discuss a model thermometer, which allows for a detailed 
understanding of the problem. We will see that, even in a system with few degrees of free- 
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dom, the temperature due to contact with a thermal reservoir is a well defined quantity that 
can be determined to arbitrary accuracy if a enough measurements are made. However, 
the observation time must be much longer than the decorrelation time of the underlying 
dynamics so that the number of independent measurements is sufficient. 

The paper is organized as follows. In Sec. [IT]we review the Einstein theory of fluctuations 
and discuss the origin of the problem. Section II I II is devoted to a discussion on the relation 
between statistics and fluctuations (uncertainty) of the temperature. In Sec. IIVI we present 
a model for a thermometer and illustrate a practical way to determine the temperature. In 
such a model, as well as for any thermometer, we have an indirect measurement of T, which 
is a statistical estimator, obtained by successive measurements of an observable. 



II. REVIEW OF THE EINSTEIN THEORY OF FLUCTUATIONS 

For consistency we briefly recall the Einstein theory of fluctuations,- focusing on the issue 
of temperature fluctuations. 

Assume that the macroscopic state of a system is described by n variables, «i, . . . , a n , 
which depend on the microscopic state X: aj = gj(X.), j = l,...,n. Denote by V the 
parameters that determine the probability distribution function of the microscopic state X. 
For example, in the canonical ensemble V c = (T, V, N) and in the microcanonical ensemble 
V m = (E, V, N). The probability distribution function of {otj} is given by 

/n 
p(X,V)Y[5( aj - gj (X))dX., (1) 
i=i 

where p(X, V) is the probability distribution function of X in the ensemble with parameters 
V. In the canonical ensemble we have 

P(a 1 ,...,a n ) = e-4^'-^l^)-^)], (2) 

where /3 = 1/ksT, ks is Boltzmann's constant, F(V C ) is the free energy of the system with 
parameters V c , and F(a\, . . . , a n \V c ) is the free energy of the system with parameters V c 
and macroscopic variables a±, . . . ,a n : 

/n 
J]5(a,-(7 i (X))e- /3H(x) dX. (3) 

3=1 
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In the microcanonical ensemble we have 



P(«i, 



e [S(a 1 ,...,a n \V m )-S{V m )]/k B = e <5S(ai,...,a„)/fe s 



(4) 



which is the Boltzmann-Einstein principle, where S is the entropy.- 

For macroscopic systems it is natural to assume that the fluctuations with respect to 
thermodynamic equilibrium are small. Therefore we can expand 8S(ai, at2, ■ ■ ■ , at n ) in a 
Taylor series about the mean values {aj}, which coincide with their values in thermodynamic 
equilibrium {a*}: 

SS(ai, . . . , a n ) ~ }] SajAijSaj (5) 



where 5a j = ctj — a* } and 



Aij 



d 2 S 



dctjda. 

Therefore small fluctuations are described by a multivariate Gaussian probability distribu- 
tion function: 



det A 

{2-kUbY 



exp 



i,j ) 



and 



(SaiSaj) = k 



B 



A" 1 



(7) 



(8) 



The entries of the matrix are calculated at equilibrium. The matrix A must be positive 
(that is, all its eigenvalues must be strictly positive), which means that the difference of the 
entropy with respect to equilibrium must be negative. The well known expression for the 
energy fluctuations 



((E - (E)) 2 ) = k B T 2 C, 



(9) 



where Cy = d(E)/dT is the heat capacity at constant volume, is a special case of Eq. ([8]). 

The A^ are functions of quantities evaluated at thermodynamic equilibrium, so that we 
can write 5S as a function of different variables. For instance, we can express S as function 
of T and V£ 

C v 



5S 



-(5T) 2 + ^ 
2T 2 K J 2TdV 



(svy. 



By using Eqs. @ or ([8]), we obtain 



((ST) 2 ) 



k B T 2 
C v ' 



(10) 



(11) 



Equation (jTOj) is correct if we consider S as a state function. In contrast, Eq. (iTTj) follows from 
Eq. (jlj) with SS related to the probability distribution function of fluctuating quantities, and 
hence the derivation of Eq. (ITTj) is formal (in the sense of the mere manipulation of symbols) 
and its meaning is not clear. Note that in Eq. (|10p 5T = 5(dE/dS). 

For a system whose energy fluctuates about the value (E) (such that dE / 8S\e=(e) = T, 
where T is the temperature of the thermal reservoir) we can think of T = 8E/dS\E as the 
temperature T 7^ T of this system, if it has been found with energy E 7^ (E). However, we 
can also think of T as the best guess for T if the energy E has been measured. It is tempting 
to say that because temperature is proportional to the mean kinetic energy, its fluctuations 
are proportional to fluctuations of the kinetic energy. This pointis a delicate one, which will 
be considered in Sec. [V] 

If we assume Eq. (TTTI) and use Eq. © , we have 



((5Tf)((8Ef) = k 2 B T* or ((8py)((5E) 2 ) = l. (12) 

Equation (fl2l) can be interpreted as a "thermodynamic uncertainty relation" formally similar 
to the Heisenberg principle. Some authors discuss a "thermodynamic complementarity" 
where energy and (3 play the role of conjugate variables^ (see Sec. IIIIDI) . 

Other authors, such as Kittel,-^ claim that the concept of temperature fluctuations is 
misleading. The argument is simple: temperature is just a parameter of the canonical 
ensemble, which describes the statistics of the system, and therefore it is fixed by definition. 

Some authors wonder about the meaning of the concept of temperature in small systems.- 
For instance, Feshbach 9 considered that for an isolated nucleus consisting of iV = O(10 2 ) 
nucleons (neutrons and protons), we expect from Eq. (ITTj) a non- negligible value of 5T/T. 
However, from experimental data we observe (in Feshbach's words) that the empirical pa- 
rameter to be identified with "does not have such a large uncertainty." McFee^ wrote 
that "The average temperature of a small system of constant specific heat connected to a 
thermal reservoir turns out to be different from that of the reservoir," and considered the 
fluctuations of 0(E) = dS(E)/dE. Because such a quantity is a function of energy, its 
fluctuations are well defined and can be studied. He found 

m?) = (13) 

which is equivalent to Eq. (1121 . 



III. STATISTICS AND STATISTICAL MECHANICS 



In this section we illustrate the approach of Mandelbrot- to statistical mechanics and 
review some basic concepts of statistics. 

A. Thermal Reservoirs 

A thermal reservoir is a system with very large (practically infinite) energy, such that 
a system with finite energy which is put in thermal contact with the reservoir comes to 
equilibrium at the temperature T of the reservoir. In thermodynamics we consider only 
macroscopic bodies, which are those that have a well defined macroscopic energy by being 
in thermal equilibrium with a reservoir. Even in a purely phenomenological context we can 
discuss the fluctuations of the energy of a generic system in thermal equilibrium.-^ However 
in statistical mechanics we can also consider systems with a few degrees of freedom, and 
therefore the fluctuations of the energy can be significant. The distribution of the energy E 
of a system that is in equilibrium with a thermal reservoir of temperature T is given by the 
Boltzmann-Gibbs density function 



where G(E) is the density of states and Z(T) is the partition function. 

When a system is in equilibrium with an thermal reservoir, we have two mutually exclusive 
situations: either we know the temperature of the reservoir and can describe the energy 
distribution of the system; or we do not know the temperature of the reservoir, and can 
determine it from the energy distribution of the system. The latter situation is called the 
inverse problem. For the inverse problem we can use the tools of estimation theory, which 
makes it possible to use the available data (in this case a series of energy values) to evaluate 
an unknown parameter (in this case T). 

If we assume that the equilibrium properties of an isolated system, whether it has been 
isolated from a thermal reservoir or not, are described by the microcanonical probability 
density, an answer to the inverse problem is also an answer to the question: is it possible 
to assign a temperature to an isolated system with a given energy? The origin and the 
importance of the question resides in the following considerations. For an isolated system 



P(E,T) 




(14) 



Z(T) 
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composed of N non-interacting subsystems, we can calculate average values of observables 
by means of the probability density 

1 / N-l \ 

-jp^gM • • -giuN-dg [un = E - ih , (15) 



G (Ey 



where «j is the energy of subsystem i, E = J2i u i is the energy of the system, and g(u) is 
the energy density of a single subsystem, so that 



G (E) 



■■ j g(ux) ■ ■ ■ g{u N )5 Ui - E j dux--- du N . (16) 



Such calculations are usually very difficult. However, for systems with a large number of 

subsystems^ we can approximate the probability density by a product of factors from the 

canonical ensemble: _ 

jj ff(M i )exp(-M i /fcT) ^ 

i Z i{T) 

where T is the temperature associated with the variable E. We have replaced non- 
independent variables by independent ones and have replaced E, the energy of the isolated 
system (and the parameter of the original distribution), by the common temperature (and 
the parameter of the approximate distributions) of its subsystems in thermal equilibrium. 
Note that this question must be posed in statistical mechanics, while in thermodynamics 
the functional relation between energy and temperature is an equation of state and does not 
call for a microscopic explanation. 



B. Estimation Theory 

We recall here a few basic concepts from estimation theory.—^ Consider a probability 
density function f(x, (3) of the variable x, which depends on the parameter (3, together with 
a sample of n independent events (x\, . . . , x n ), governed by the probability density /, so 
that the probability density of the sample is 

L(x x , ...,x n ,0) = f(x x , /?)••• f(x n , (3). (18) 

We would like to estimate the unknown parameter from the values {x{\. For this purpose 
we have to define a suitable function of n variables, 0(x\, . . . , x n ), to obtain the estimate of 
f3 from the available information. The quantity (3 is, by construction, a random variable. 
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We can calculate, for instance, its expected value and its variance. We assume that f3 is an 
unbiased estimate of 0, that is, ((3) = (3. It is clear that the usefulness of an estimating 
function is tightly linked to its variance. 

Once the function /3(xi, . . . ,x n ) has been introduced, each sample ( also 
be specified by giving the value of /3 for the particular sample and the values of n — 1 
other variables {£} that are necessary to specify the point on a surface of constant f3. In 
other words, a change of variables (xi, . . . , x n ) — > £i, . . . , can be made, so that the 
probability of a sample may be written as 

L(xi, ...,x n )dxi--- dx n = F(/3, p)h(£i, . . . , £ n -i\P, P) d(3 d^--- d£ n -i, (19) 

where F(f3,/3) is the density of the variable (3 (depending on (3) and h({£i}\/3, (3) is the 
density of the variables {&}, conditioned by the value of (3 and, in general, depending on 
the parameter (3. 

Given certain general conditions of regularity, we can obtain the Cramer-Rao inequality 13 
for unbiased estimators: 

J {p-pfF0)d$> jn J (^ln/(x,/3)) 2 /(x,/3)^| , (20) 

where the denominator on the-right hand side of Eq. ( 1201) is known as the Fisher 
information, 13 which gives a measure of the maximum amount of information we can ex- 
tract from the data about the parameter to be estimated. This inequality puts a limit on 
the ability of making estimates, and also suggests that the estimator should be chosen by 
minimizing the inequality. When the variance of (3 is the theoretical minimum, the result (3 
is an "efficient estimate."— We have followed the convention of distinguishing between an 
efficient estimate, which has minimum variance for finite n, and an asymptotically efficient 
estimate, which has minimum variance in the limit n — > oo. 

Starting from the probability of a given sample, Eq. ffl8|) . the method of maximum 
likelihood estimates the parameter (3 as one that maximizes the probability, or is a solution 
of the equation 

^lnL(x 1 ,...,x ri ,/3) = 0. (21) 

Under certain general conditions on some derivatives of f(x, (3) with respect to (3,— Eq. (I2ip 
has a solution that converges to (3 as n — > oo. The solution is asymptotically Gaussian and 
is an asymptotically efficient estimate of (3. In other words, there exists a random variable 
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{3(x\, . . . ,x n ) which is a solution of Eq. (12 ip such that a maximum likelihood estimator of /3 
is obtained, whose probability density in the limit n — > oo approaches a normal probability 
density centered about (3 with variance 

|n|(^ln/(x,/3)) 2 /(^/3)^| . (22) 
C. Thermal Reservoirs Again 

We now return to a system in equilibrium with a reservoir of unknown temperature on 
which we have performed a measurement of energy: the system considered in this section is 
a gas of N classical particles. For simplicity, we begin by considering measurements of the 
energy u of a single particle, whose probability distribution we write as 

g(u)exp(-Pu) 

P(%P) = , (23) 

where the parameter (3 is l/k B T and the density of single particle states g(u) is assumed 
to be known. Suppose that we have measured n independent values of particle energy 
(ui, . . . , u n ). We can write 

p / R \ _ fl(Mi)exp(-/^i) g(u n ) exp(-(3u n ) 

or 

p(uu .-»,;) 9{u X^ )G ° m ^~ m (25) 

= h(u 1 ,...,u n ^ 1 \U)P(U,/3), (26) 

where U = J2i u i an d Gq(U) is defined as in Eq. (TIB"]) with iV replaced by n; note that in 
this section U has a different meaning with respect to the introduction. Because P(U, (3) is 
the probability density of measuring a total energy U in n independent single particle energy 
measurements, we see that h(ui, . . . ,u n -x\U), which is the conditional distribution of the 
energy in the sample given the total measured energy, does not depend on (3. We conclude 
that good estimators of (3 can be constructed as a function of the sum of the measured 
energies. 

One possible choice of an estimator is the maximum likelihood estimator for which the 
value of (3 is determined by 

a n 
-^1^(/^=X> ( 27 ) 
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Equation (127)) establishes a one-to-one relation between ft and £V Uj. For large n, the values 
of /3 extracted from Eq. ( |2"T]) are normally distributed around the true value /3, with the 
variance ^ 

|n J (u- {u)) 2 p{u 1 f3)du^ = (28) 

where 

(it) = y uP(u,ft)du, (29) 

and o"^ is the variance of the single-particle energy calculated with the true ft. 

For instance, if the density of states is g(u) oc u v , then from Eq. (IA.2I) the maximum 
likelihood estimate is 

Pmle — jj — , 1<jUJ 

which is not an unbiased estimate because as shown in the Appendix [see Eqs (1A.8l and lA.9j) 

^->=K 1+ ;^TiFi)^ (31) 

As Eq. (13 ip shows, the maximum likelihood estimator is asymptotically unbiased and, from 
general theorems on maximum likelihood estimator [see the end of Sec. IIII Bl and Eq. (1281)]. 
we know that, because a\ = (rj + 1)(3~ 2 , we have for large n 

4 ~ / 2 x (32) 

PMLE 7l(77+l) 

or 



<T$ 1 



(33) 



P y/n(r} + 1) ' 

Therefore we can obtain an estimate of the parameter as accurate as we want by using a 
sufficiently large sample. 

Another estimator for ft is given by the random variable 

% = ^lnGo(CO, (34) 

where Gq(U) is defined in Eq. ffTB]) with replaced by n. Unlike the maximum likelihood 
estimator, the right-hand side of Eq. (l34"j) is an unbiased estimator of ft for any n, but like 
the maximum likelihood estimator, it is not an efficient estimator for finite n. 

For instance, with the density of states given as before, the estimate is (see Eq. (|A.5[) and 
the following discussion) 

o + - 1 

PG = jj , (35) 
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with the variance 



a 




n(r) + 1) - 2 



1 



) 



> 



na„ 



1 



u 



(36) 



For this particular density of states Eq. ( 136]) also shows that 0g becomes asymptotically 
efficient, because it attains the Cramer- Rao lower bound in the limit n — > oo. This behav- 
ior is more general: we can demonstrate that for certain regularity conditions, these two 
estimators are asymptotically equivalent.— 1 ^ 

An important point of the preceding discussion is that, due to the exponential form of 
the canonical ensemble probability density, all of the information about is contained in 
the total energy of an isolated sample. We gain nothing by knowing the distribution of 
this energy among the n elements of the sample. We say that U = Yli u i * s sufficient for 
estimating /3. Therefore we may also argue as follows. 

Instead of n measurements of the molecular energy, we make one measurement of the 
energy E on the macroscopic system with density P(E, (3) = G(E) exp(—(3E)/Z]y((3). G(E) 
is the density of states of the entire system, which reduces to Gq(E) for systems made of 
non-interacting components. The Cramer-Rao inequality becomes 



where a\ is the variance of the canonical energy of the macroscopic body. 

For an ideal gas of iV identical particles, a 2 E = Na 2 , and Eq. (1371) becomes cr| > 1/Na 2 . 
With regard to the determination of /3, a single value of the macroscopic energy contains 
the same information as N microscopic measurements. 

We know that a non-ideal gas of N identical particles with short-range interparticle 
interactions behaves (if not at a phase transition) as if it were composed of a large number, 
N e ff oc N, of (almost) independent components, and a\ ~ N e ga 2 , where a 2 is the variance 
of one component. For instance, consider a system of iV particles in a volume V with a 
correlation length £ = (cV/N) 1 ^ 3 , where c>l indicates strong correlations. We have N c g ~ 
V/£ 3 = c _1 iV. Thus, even if n = 1 in Eq. ( )37j) . that is, we perform a single measurement 
of energy, the variance of E, which is the energy of a macroscopic system, is extensive and 
the variance of f3 may be small. We have <r| > 1/N e sa 2 , with N e s oc N ^> 1. By looking 
at E as the result of iV e fj elementary energy observations, our preceding considerations can 
be applied here with N eS playing the role of n. In particular, the asymptotic properties for 




(37) 
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large N e g of the two estimators are preserved, and the estimates of (3 obtained by the two 
expressions 

-^-\nZ N ((3)^ =E, (38a) 

d/3 /3 M le 

and 

h = ^\nG{E) (38b) 

approach the same value for N e Q ^> 1, a condition that is verified for macroscopic bodies. 
Therefore, for a macroscopic system, we can obtain a good estimate of /3 even with a single 
measurement of its energy, and we can assign a reliable value of /3 to an isolated macroscopic 
system. 

We have given an estimation theory justification of the standard definition of the temper- 
ature in statistical mechanics either in the canonical or microcanonical ensemble by means 
of Eq. 



D. Uncertainty relations in statistical mechanics? 

From our discussion we see that the fluctuations of the random variables /?mle and /3q 
when n ^> 1 are approximately Gaussian with a variance l/(na^). The fluctuations of the 
total energy of the sample U = Y^i u i also become Gaussian (by the central limit theorem) 
with variance na\. Therefore, in this limit, we have cr-crf; = 1. Is there a deeper meaning? 
As can be seen, for instance, for g(u) oc u v , from Eqs. fl30|) and fl35|) . /?mle and /3q are 
functions of U/n, and therefore, from crj) oc n a~ oc cr^/n 2 ~ 1/n. 

Although the Rao-Cramer inequality, Eq. fl3T|) . is formally similar to Eq. ( I12p . which was 
obtained in the framework of Einstein's theory, the analogy is inexact and misleading. In 
mathematical statistics the quantity <t| measures the uncertainty in the determination of 
the value of (3 and not the fluctuations of its values. 



IV. MODEL THERMOMETER 



To illustrate the ideas we have discussed in we discuss the following mechanical model for 
a thermometer. A box is filled with iV non-interacting particles of mass m. On the top of the 
box is a piston of mass M which can move without friction in the x direction. Although the 
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box is three-dimensional, only the motion in the x direction is relevant because we assume 
that the particles interact only with the piston. The other directions are decoupled from 
x, independently of their boundary conditions. The one-dimensional Hamiltonian of the 
system is 

N r> 2 P 2 



2m 2M 



where X is the position along the x axis of the piston, and the positions of the particles Xi 
along the same axis are constrained to be between and X. A force F acts on the piston, 
and in addition there are elastic collisions of the gas particles with the piston. The particles 
exchange energy with a thermostat at temperature T placed on the bottom of the box at 
x = 0. When a particle collides with the ground, it acquires a speed v with probability 
density 17 

P( v )= -^-ve- mv 2/2kBT . (40) 

In the following we set k B — 1, which is equivalent to measuring the temperature in units 
of l/k B . 

The statistical mechanics of the system can be obtained in the canonical ensemble. The 
probability distribution function for the positions of the particles is 

N 

P{x x , . . . , x N , X) = c N H 6{X - Xi)e-^ FX , (41) 

i=i 

where c N = ((3F) N+1 /T(N + 1) = ((3F) N+1 /N\. We integrate over the positions of the 
particles and obtain 

P(X) = -L([3F) N+l X N e-P FX . (42) 



The mean value (X) is 



(A') = (43, 



The probability distribution function for X obtained by numerical simulations of the system 
is in agreement with Eq. ( 1421) even for small values of N, as can be seen in Fig. [H 

In the following we will estimate the temperature from a single (long) time series of the 
simulations of the piston position. This procedure is common both in numerical and in real 
experiments. Therefore it is necessary to consider the dynamical statistical properties of our 
system. For JV>1 and M/m > 1 we expect that the variable 5X = X — (X) is described 
by a stochastic process.— Figure [2] shows the typical behavior of SX(t) for different values 
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of N. For our purposes (in particular, for TV not too large) it is not necessary to perform an 
accurate analysis. 

We now discuss a measurement of the temperature with its uncertainty, regardless of the 
number of degrees of freedom. We assume that only the macroscopic degree of freedom, the 
position of the piston, is experimentally accessible. We want to determine the temperature 
and its uncertainty by a series of measurements. From Eq. ([4*3]) the temperature can be 
estimated as 

f =NTl> < 44 > 

where X is an estimate of the average piston position. Assume that we have M independent 
measurements . . . , Because of the peculiar shape of the probability distribution 

function (1421) (it is an infinitely divisible distribution 1 -), we have that the variable Xj^ = 
+ . . . + X^)/M has a probability distribution function of the same shape, where N 
is replaced by NAT in Eq. (HSl) . The variance of Xjy is a^/Af, because the values of X 
are independent. From Eq. f l4"2]) we have a 2 ^ = (N + 1)/(3 2 F 2 [see the Appendix for the 
calculation of moments of the distribution (l42l)]. and therefore 

2 IN + l 
°^ = N7J>W (45) 



An analysis of the distribution ff42|) shows that the Cramer-Rao lower bound for the 
estimators of T is 

JHW+T)' (46) 

so that we can verify that the random variable 



^t(^) (47) 



is an unbiased and efficient estimator for every Af. 

We now discuss how to determine the temperature and its uncertainty from a time series 
{^i\ J iLii where X{ = X(i5t) and 8t is the sampling time interval; AfSt is the total observation 
time. The procedure we will describe is also valid for non-independent data {Aj}, and 
depends only on the validity of Eq. fj4"5|) and not on Eq. (j32D • 

The variance cr|, of the estimator T, given by Eq. ( 14T)|) is of order ~ which can be 
non-negligible for single measurements on small systems. As described in Sec. 1111} it can 
be arbitrarily reduced by increasing the number Af of measurements. To clarify this point, 
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we numerically computed the variance o% for several values of iV as a function of Af. In 
general, the data are correlated, and a correlation time r must be estimated numerically. 
The simplest way is to look at the shape of the correlation functions of the observables 
of interest. If 5t < r, the effective number of independent measurements is approximately 
AfeS = AfSt/r. By plotting iVcr? versus Af e s we expect that the dependence on N disappears, 
resulting in a collapse of the curves (see Fig. HJ). Note that for large times, the uncertainty 
goes to zero as 1/Af e s, i n agreement with Eq. (H51) . We will see that it is sufficient to know the 
typical time scales of the process. Namely, we need to determine the correlation functions 
(8X(t)SX(0)) or (6V(t)6V(0)), where 5V = 5X. We can define a characteristic time for X, 
Tx, as the minimum time t such that |Cx(£)| < 0.05, where 

(6X(t)SX(0)) 

Cx{t) ~ (6X*(0)) ■ (48) 
In the same way we can introduce the characteristic time for the velocity autocorrelation Ty 
(see Fig. ED- 

The previous estimate of the temperature is well-posed once we know that Eq. f )43|) holds. 
However, there are other possibilities. For instance we can choose to monitor the velocity 
of the piston instead of the position, and repeat the same analysis. It is straightforward to 
see that the velocity is Gaussian distributed, with variance (V 2 ) = T/M, and only the mass 
of the piston is needed for the estimate. The choice of which estimator is more suitable is a 
matter of convenience. 

For systems exhibiting aging and ergodicity breaking there are model thermometers that 
are very different from the one proposed here and are based on the linear response of the 
system and its comparison with unperturbed correlators.— 1 ^ In this case we may have access 
to "effective" temperatures related to slow, non-equilibrated, degrees of freedom, and the 
problem of uncertainty becomes more complicated, because the effective temperatures may 
be time-dependent.— 



V. CONCLUSIONS 



We have considered the concept of temperature fluctuations and showed that it makes 
sense only when associated with uncertainties of measurement. In a molecular dynamics 
computation at fixed energy it is common practice to look at the fluctuations of the kinetic 
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energy. Because the mean value of the kinetic energy per particle, K, is proportional to the 
temperature, it might be concluded that the fluctuations of K are related to the fluctuations 
of the temperature, ((ST) 2 ) oc ((5K) 2 ). However, it is the random variable K that fluctuates 
from sample to sample, not its mean value. Hence, in the preceding relation we should 
properly write ^(<5T) 2 ^ instead of ((<5T) 2 ), because we are using the random variable K as 
an estimator of the temperature, which as stressed by Kitted, is an unknown but fixed 
parameter. If the conceptual difference between parameters and fluctuating variables is 
ignored, we may obtain suggestive but deceiving interpretations of results such as Eq. (fl2|) . 
Estimation theory gives a precise role to uncertainties in temperature measurements, and 
establishes which properties make a temperature estimator better than others and leads to 
meaningful results such as Eq. fliZUj) . In this framework we can appreciate the different use 
we can make of a system, either as a reservoir or as a thermometer. For the model system 
of Sec. |IV]the relative uncertainty in the temperature estimation is 

AT _ \/4 _ 1 
T ~ T y/tf(N + l)' 

If N 1, even a single estimate, Af = 1, leads to a precise value for T. The system is 
not only an efficient thermometer, but also can be thought of as a reservoir with a definite 
temperature. In contrast, if N is small, a single measurement can lead to a large uncertainty 
in the estimate of temperature, but the uncertainty can be arbitrarily reduced by increasing 
the number of measurements. The small system can be used as a thermometer, in this case 
the dynamics plays a non-negligible role, and the characteristic decorrelation time of the 
relevant variables dictates the effective number of independent measurements. 

For systems with a few degrees of freedom the uncertainty in the temperature can be 
reduced by an adequate amount of data, which is an answer to the problem concerning 
pointed out by Feshbach in Ref. |9. 
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Appendix: Some useful formulas 



To obtain Eq. (!30|) with g(u) = ju 71 (7 independent of u), we have 



— / g(u) exp(—/3u) du = / ^vP exp(— /3u) du (A.l) 
Jo Jo 

/In 77+I 

= 7(73) rfo + 1), (A.2) 

where T(z) (with 2 > 0) is the Gamma function. From Eq. ( 1271) we immediately find 
Eq. pjp. 

We now derive Eqs. (131]) . ( )35|) . and (136|) starting from the power-law density of states 
g(u). First, we show that for the density 

_ G (U)eM-PU) G (U)exp(-pU) 
[ ,P) Z»{p) J?G {U)exj>(-/3U)dU' 1 6) 

we have G$(U) oc f/ n ( r ' +1 ) _1 . If we make the change of variables = xJJ in Eq. ( Fl6|) . we 
obtain 

Go(Eo = 7 ™f/™("+ i ) r ... r x\ x \ . . . ^ ^i^- 1 ) rfxidX2 . . . ( A4 ) 

Jo io t7 
where we have used the relation 5 (aw) = S(w)/a. We make the U dependence explicit and 
write 

G (U) = 7 n f/ n( " +1) - 1 /(n, rf). (A.5) 
We can calculate the averages 

1 \ _ JlT 1 ^- 1 (l/U k )exp(-pU)dU , 



, U k I J U n to+V-i exp(-pU) dU 
by means of the integrals 

POD 

/ U a exp(-[3U)dU = (3- {a+1) T(a + 1). (A.7) 
Jo 

From Eqs. (|30|) . ( 1A.6j) . and ( 1A.7|) and the property ^r(z) = T(z + 1), we obtain 

<^mle> = n( V + ) = nfo + l)^ r( r ( ( ^+ 1 ^ ) 1) ( A - 8 ) 

= 71(77 + 1)/? 1 (A.9) 
77(77 + 1) — 1 



which is Eq. (131]) . 
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From Eqs. flS]) and flA~~5j) we obtain Eq. <^> and (j3 G ) = (3. From Eq. f lA~6|) with k = 2 
we obtain 

J_\ = t fA 10) 

U 2 / (n(77 + l)-l)(n(77 + l)-2)' 1 ' 

By recalling the definition of (3 G in Eq. (I3"o]) . the fact that <r| = (/3 G ) — (/?g) 2 , and (/3c) = (3, 



we arrive at Eq. ( 1361) . 

With similar calculations involving the Gamma function, we arrive at Eq. ( I4"5l) for the 
variance of the variable X N = (X® + . . . + X^)/Af, that is o\jM. If Eq. ([42]) gives the 
distribution of X, its moments are 

( X fc ) = ^ F ^ +1 J™ X N+k e-? FX dX. (A.ll) 

In terms of the variable z = /3FX, we obtain 

v 7 X!(/3F) fe y X!(/3F) fc (/3F) fe v ; 

Therefore, <r| = (X 2 ) - (X) 2 = (X + l)/(/3F) 2 . 
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FIG. 1: (Color online). The probability distribution function of the position X of the piston 
obtained by numerical simulations with F = 10 and T = 1 (dots) for different values of N and M. 
(a) N = 30 and N = 5 with M = 2N. (b) N = 30 with M = N/2. The black lines show the 
analytical result from Eq. (|42|) . Each simulation has been performed up to a time such that each 
particle collided with the piston at least 10 5 times. 
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FIG. 2: (Color online). Time series of the displacement of the piston from its mean value for 
different values of N. The other parameters are M = 10, F = 10, T = 1, and m = 1. One can 
see that, although the qualitative behavior does not change with N, the amplitude of fluctuations 
decreases with N. 
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FIG. 3: (Color online). The autocorrelation function of the position X and velocity V of the 
piston for N = 30, M = 10, F = 10, T = 1, and m = 1. The estimates of the correlation times 
tx and ry are shown. Although the two correlation functions Cx(t) and Cy(t) are different, the 
corresponding characteristic times are of the same order. 
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FIG. 4: The quantity Na^ for different values of N is numerically calculated and plotted as 
function of 7V c fr = Af5t/r. For large times, the uncertainty goes to zero as 1/J\f e s. The parameters 
are 5t = 0.01, M = 10, m = 1, F = 10, T = 1, N = 5, 30, 70, and 200. It is clear that for the 
uncertainty of T, the relevant quantity is N c q which depends both on N and r. 
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